PAGE: 1 RAW SEQUENCE LISTING DATE: 10/22/1999 

PATENT APPLICATION US/09/417,251 • TIME: 15:02:48 

Input Set: 1417251. RAW 



This Raw Listing contains the General Information 
Section and up to first 5 pages. rH' 



1 <110> APPLICANT: Cahoon, Rebecca E. 

2 Miao, Guo-Hua 

3 Herrman, Rafael 

4 Rafalski, Antoni 

5 McCutchen, Bill F. 

6 <120> TITLE OF INVENTION: Plant Protein Disulfide Isomerases 

7 <130> FILE REFERENCE: BB1085 US NA 

8 <140> CURRENT APPLICATION NUMBER: US/09/417 , 251 

9 <141> CURRENT FILING DATE: 1999-10-13 

10 <150> EARLIER APPLICATION NUMBER: 60/049,408 

11 <151> EARLIER FILING DATE: 1998-10-15 

12 <160> NUMBER OF SEQ ID NOS : 20 

13 <170> SOFTWARE: Microsoft Office 97 

14 <210> SEQ ID NO 1 

15 <211> LENGTH: 504 

16 <212> TYPE: DNA 

17 <213> ORGANISM: Zea mays 

18 <22 0> FEATURE: 

19 <221> NAME/KEY: unsure 

20 <222> LOCATION: (463) ^ 

21 <220> FEATURE: 

22 <221> NAME/KEY: unsure 

23 <222> LOCATION: (469) ^ 

24 <22 0> FEATURE: 

25 <221> NAME/KEY: unsure 

26 <222> LOCATION: (471) ^ 

27 <220> FEATURE: 

28 <221> NAME/KEY: unsure . 

29 <222> LOCATION: (496) 
3 0 <400> SEQUENCE: 1 

31 tgcctgccct gtcctgtcct gttcagcgga accttctctt tgtgttttat aggttacccc 60 

32 gtcaaaaaga cagcccatca tgcaccacaa gaagatcgcc tgcagcttca tggctgctct 12 0 

33 ggctgcctat gcctctgctg ccgactcaga tgttcatcag ctaaccaagg acaccttcga 180 

34 ggagtttgtc aagtccaaca atctcgtcct cgctgagttc tttgctccct ggtgcggtca 240 

35 ctgcaaggcc ctcgcccccg agtacgagga ggccgccaca actctcaagg agaagaacat 300 

36 caagcttgcc aagattgact gcactgagga gtccgacctc tgcaaagacc agggcgtcga 360 

37 gggttacccc accctcaagg tcttccgtgg tcttgacaat gtcactccct actctggcca 420 
W-->\T 38 gcgtaaggcc gctggtatca ttc tacatga ttaagagttc ctncccggi^r nttcatttta 480 
W--\&\^ 39 caaa gg gaac cctcgngggt ttaa - ~ - 504 

40 <210> SEQ ID NO 2 

41 <211> LENGTH: 110 

42 <212> TYPE: PRT 

43 <213> ORGANISM: Zea mays 

44 <400> SEQUENCE: 2 
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Input Set: 1417251. RAW 



w-> 0 t 



45 




Met 


Ala 


Ala 


Leu 


Ala 


Ala 


Tyr 


Ala 


Ser 


Ala 


Ala 


Asp 


Ser 


Asp 


Val 


His 




46 




1 








5 










10 










15 






47 




Gin 


Leu 


Thr 


Lys 


Asp 


Thr 


Phe 


Glu 


Glu 


Phe 


Val 


Lys 


Ser 


Asn 


Asn 


Leu 




48 










20 










25 










30 








49 




Val 


Leu 


Ala 


Glu 


Phe 


Phe 


Ala 


Pro 


Trp 


Cys 


Gly His 


Cys 


Lys 


Ala 


Leu 




50 








35 










40 










45 










51 




Ala 


Pro 


Glu 


Tyr 


Glu 


Glu 


Ala 


Ala 


Thr 


Thr 


Leu 


Lys 


Glu 


Lys 


Asn 


He 




52 






50 










55 










60 












53 




Lys 


Leu 


Ala 


Lys 


He 


Asp 


Cys 


Thr 


Glu 


Glu 


Ser 


Asp 


Leu 


Cys 


Lys 


Asp 




54 




65 










70 










75 










80 




55 




Gin 


Gly 


Val 


Glu 


Gly 


Tyr 


Pro 


Thr 


Leu 


Lys 


Val 


Phe 


Arg 


Gly 


Leu 


Asp 




56 












85 










90 










95 






57 




Asn 


Val 


Thr 


Pro 


Tyr 


Ser 


Gly 


Gin 


Arg 


Lys 


Ala 


Ala 


Gly 


He 








58 










100 










105 










110 








59 


<210> 


SEQ 


ID NO 3 






























60 


<211> 


LENGTH : 


505 






























61 


<212> 


TYPE: DNA 






























62 


<213> 


ORGANISM: Glycine max 
























63 


<220> 


FEATURE : 
































64 


<221> 


NAME/KEY: unsure ^ 


























65 


<222> 


LOCATION: (503) 




























66 


<400> 


SEQUENCE : 3 






























67 




tctttctggt 


actccacctg gtattgttgt tgaagatcgt 


aataccaata 


aaaattatgt 


60 


68 




ttatccacaa 


gctaatgaaa ttactgaaga tgcattacgt 


gcacatttac 


aaggttatgt 


120 


69 




tgatggtaca 


cttcaaccca ctgtcaaatc tgaagaaatc 


ccagaaaaac 


aagatggtcc 


180 


70 




agtttatgta 


ctcgtgggta aaaattttga atccattgtt 


atggatgaaa 


ctaaagatgt 


240 


71 




attagttgaa 


ttttatgcac catggtgtgg acattgtaaa 


acattagctc 


ccaaatacga 


300 


72 




tgcattaggt < 


gaatcattca agtcaaaccc caatgtcatt 


attgccaaga 


ttgatgccac 


360 


73 




tgcaaatgat 


acccctgttg atattcaagg tttccccact 


attatctatt 


ggccagctaa 


420 


74 




taataagaaa . 


aatccaatta catatgaagg tgaacgtact 


gaatcagcac 


ttgctgcatt 


480 


75 




tgtacgtgaa^ aaatggtcaa cantt 




















505 


76 


<210> 


SEQ 


ID NO 4 






























77 


<211> 


LENGTH : 


158 






























78 


<212> 


TYPE: PRT 






























79 


<213> 


ORGANISM: Glycine max 
























80 


<400> 


SEQUENCE: 4 






























81 




Pro 


Gly 


He 


Val 


Val 


Glu 


Asp 


Arg 


Asn 


Thr 


Asn 


Lys 


Asn 


Tyr 


Val 


Tyr 




82 




1 








5 










10 










15 






83 




Pro 


Gin 


Ala 


Asn 


Glu 


He 


Thr 


Glu 


Asp 


Ala 


Leu Arg 


Ala 


His 


Leu 


Gin 




84 










20 










25 










30 








85 




Gly 


Tyr 


Val 


Asp 


Gly 


Thr 


Leu 


Gin 


Pro 


Thr 


Val 


Lys 


Ser 


Glu 


Glu 


He 




86 








35 










40 










45 










87 




Pro 


Glu 


Lys 


Gin 


Asp 


Gly 


Pro 


Val 


Tyr 


Val 


Leu 


Val 


Gly 


Lys 


Asn 


Phe 




88 






50 










55 










60 












89 




Glu 


Ser 


He 


Val 


Met 


Asp 


Glu 


Thr 


Lys 


Asp 


Val 


Leu 


Val 


Glu 


Phe 


Tyr 




90 




65 










70 










75 










80 




91 




Ala 


Pro 


Trp 


Cys 


Gly 


His 


Cys 


Lys 


Thr 


Leu 


Ala 


Pro 


Lys 


Tyr 


Asp 


Ala 




92 












85 










90 










95 






93 




Leu 


Gly 


Glu 


Ser 


Phe 


Lys 


Ser 


Asn 


Pro 


Asn 


Val 


He 


He 


Ala 


Lys 


He 




94 










100 










105 










110 
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95 Asp Ala Thr Ala Asn Asp Thr Pro Val Asp lie Gin Gly Phe Pro Thr 

96 115 120 125 

97 lie lie Tyr Trp Pro Ala Asn Asn Lys Lys Asn Pro lie Thr Tyr Glu 

98 130 135 140 

99 Gly Glu Arg Thr Glu Ser Ala Leu Ala Ala Phe Val Arg Glu 

100 145 150 155 

101 <210> SEQ ID NO 5 

102 <211> LENGTH: 1692 

103 <212> TYPE: DNA 

104 <213> ORGANISM: Zea mays 

105 <400> SEQUENCE: 5 

106 gcacgaggcg cggcggagat cgaatcgagc gcccgccacg gcgatggcga ctagagtcct 60 

107 gccgccggct ctgctctctt tcatactcct cctgctgctc tcgctctcag cccgcgacac 120 

108 cgtcgccgcg ggcgaggatt tcccacgcga cgggcgggtg atcgacctcg acgacagcaa 180 

109 tttcgaggcg gcgctgggcg ccatcgactt tctcttcgtc gacttctacg ccccttggtg 240 

110 cggccactgc aagagacttg cgcccgagtt agatgaagct gcaccggtgt tgtcagggtt 3 00 

111 gagtgagcct attgttgttg ccaaagtcaa cgctgataaa tacagaaaac tcggatcaaa 360 

112 atatggagtg gatgggttcc ctaccctcat gctctttatc catggtgttc caattgaata 42 0 

113 cactggttcg aggaaagctg accagcttgt ccgcaatctg aagaagttcg tttcgccaga 480 

114 tgtttctatc cttgagtcag attctgcgat aaagaacttt gttgagaatg ctgggataag 540 

115 N ctttccgata ttccttggtt ttggggtgaa tgactcattg attgctgagt atggaaggaa 600 

116 atacaagaaa agagcctggt ttgctgttgc taaagatttc tctgaggaca tcatggtagc 660 

117 ctatgaattt gataaggttc cagcactagt tgctatccat ccaaagtata aggaacagag 72 0 

118 tttgttctat ggcccatttg aagaaaattt cttagaagat tttgtacggc aatcccttct 780 

119 ccctttggtt gtcccaatca atacagagac actaaaaatg ctgaatgatg atcagaggaa 840 

120 agttgttctc acaattttgg aggatgattc agatgaaaac tctacgcaac tggtaaagat 900 

121 tttgcgatct gctgctaatg caaaccgtga tttggtgttt ggatatgttg gaatcaagca 960 

122 atgggatggg tttgtggaga cttttgatgt ttccaagagc tcacagctgc caaagctact 1020 

123 tgtgtgggat agagatgagg agtatgagct agtggatggt tcagagagat tagaagaagg 1080 

124 tgaccaagca tctcaaataa gccaattcct tgagggatac agagcaggaa gaacaacaaa 1140 

125 gaagaaaatc accggccctt ctttcatggg tttcctgaac tctctggtca gcctgaactc 1200 

126 gctgtacatc cttatatttg tcatcgccct tctgtttgtc atggtgtact ttgctgggca 1260 

127 agatgatact cctcagccaa gacgaattca cgaagagtga tgaaagcttg ttgggcttct 132 0 

128 tgcacctaaa gatggctaat ctaccgggag attagctttt gtattaattg tacaaaagct 1380 

129 tcaactgacg caagtcgtga agagtggttt tggcaatttg gccattcatg ctgagtttct 1440 

130 tcaatctcta ttggcgacat caatttctgc atcctgccta tttgtgtttc tgctttgtgc 1500 

131 ccttcaattt gttctttaat ttagagctta gaaattagcc tctgcctgtg tattctggaa 1560 

132 cctgccattc cagagtccat ttctgtgaaa atatatttat tattatcata ctctgctacc 1620 

133 gagcttttgt acaattaata caggatatat agactgttct ggtgcacaaa aaaaaaaaga 1680 

134 aaaaaaaaaa aa 1692 

135 <210> SEQ ID NO 6 

136 <211> LENGTH: 418 

137 <212> TYPE: PRT 

138 <213> ORGANISM: Zea mays 
13 9 <400> SEQUENCE: 6 

140 Met Ala Thr Arg Val Leu Pro Pro Ala Leu Leu Ser Phe lie Leu Leu 

141 15 10 15 

142 Leu Leu Leu Ser Leu Ser Ala Arg Asp Thr Val Ala Ala Gly Glu Asp 

143 20 25 30 

144 Phe Pro Arg Asp Gly Arg Val lie Asp Leu Asp Asp Ser Asn Phe Glu 
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145 








35 










40 










45 








146 




Ala 


Ala 


Leu Gly 


Ala 


He 


Asp 


Phe 


Leu 


Phe 


Val 


Asp 


Phe 


Tyr 


Ala 


Pro 


147 






50 










55 










60 










148 




Trp 


Cys 


Gly His 


Cys 


Lys 


Arg 


Leu 


Ala 


Pro 


Glu 


Leu 


Asp 


Glu 


Ala 


Ala 


149 




65 










70 










75 










80 


150 




Pro 


Val 


Leu 


Ser 


Gly 


Leu 


Ser 


Glu 


Pro 


He 


Val 


Val 


Ala 


Lys 


Val 


Asn 


151 












85 










90 










95 




152 




Ala 


Asp 


Lys 


Tyr 


Arg 


Lys 


Leu 


Gly 


Ser 


Lys 


Tyr 


Gly 


Val 


Asp 


Gly 


Phe 


153 










100 










105 










110 






154 




Pro 


Thr 


Leu 


Met 


Leu 


Phe 


He 


His 


Gly 


Val 


Pro 


He 


Glu 


Tyr 


Thr 


Gly 


155 








115 










120 










125 








156 




Ser 


Arg 


Lys 


Ala 


Asp 


Gin 


Leu 


Val 


Arg 


Asn 


Leu 


Lys 


Lys 


Phe 


Val 


Ser 


157 






130 










135 










140 










158 




Pro 


Asp 


Val 


Ser 


He 


Leu 


Glu 


Ser 


Asp 


Ser 


Ala 


He 


Lys 


Asn 


Phe 


Val 


159 




145 










150 










155 










160 


160 




Glu 


Asn 


Ala 


Gly 


He 


Ser 


Phe 


Pro 


He 


Phe 


Leu 


Gly 


Phe 


Gly 


Val 


Asn 


161 












165 










170 










175 




162 




Asp 


Ser 


Leu 


He 


Ala 


Glu 


Tyr 


Gly 


Arg 


Lys 


Tyr 


Lys 


Lys 


Arg 


Ala 


Trp 


163 










180 










185 










190 






164 




Phe 


Ala 


Val 


Ala 


Lys 


Asp 


Phe 


Ser 


Glu 


Asp 


He 


Met 


Val 


Ala 


Tyr 


Glu 


165 








195 










200 










205 








166 




Phe 


Asp 


Lys 


Val 


Pro 


Ala 


Leu 


Val 


Ala 


He 


His 


Pro 


Lys 


Tyr 


Lys 


Glu 


167 






210 










215 










220 










168 




Gin 


Ser 


Leu 


Phe 


Tyr 


Gly 


Pro 


Phe 


Glu 


Glu 


Asn 


Phe 


Leu 


Glu 


Asp 


Phe 


169 




225 










230 










235 










240 


170 




Val 


Arg 


Gin 


Ser 


Leu 


Leu 


Pro 


Leu 


Val 


Val 


Pro 


He 


Asn 


Thr 


Glu 


Thr 


171 












245 










250 










255 




172 




Leu 


Lys 


Met 


Leu 


Asn 


Asp 


Asp 


Gin 


Arg 


Lys 


Val 


Val 


Leu 


Thr 


He 


Leu 


173 










260 










265 










270 






174 




Glu 


Asp 


Asp 


Ser 


Asp 


Glu 


Asn 


Ser 


Thr 


Gin 


Leu 


Val 


Lys 


He 


Leu 


Arg 


175 








275 










280 










285 








176 




Ser 


Ala 


Ala 


Asn 


Ala 


Asn 


Arg 


Asp 


Leu 


Val 


Phe 


Gly 


Tyr 


Val 


Gly 


He 


177 






290 










295 










300 










178 




Lys 


Gin 


Trp 


Asp 


Gly 


Phe 


Val 


Glu 


Thr 


Phe 


Asp 


Val 


Ser 


Lys 


Ser 


Ser 


179 




305 










310 










315 










320 


180 




Gin 


Leu 


Pro 


Lys 


Leu 


Leu 


Val 


Trp 


Asp 


Arg 


Asp 


Glu 


Glu 


Tyr 


Glu 


Leu 


181 












325 










330 










335 




182 




Val 


Asp 


Gly Ser 


Glu 


Arg 


Leu 


Glu 


Glu 


Gly 


Asp 


Gin 


Ala 


Ser 


Gin 


He 


183 










340 










345 










350 






184 




Ser 


Gin 


Phe 


Leu 


Glu 


Gly 


Tyr 


Arg 


Ala 


Gly 


Arg 


Thr 


Thr 


Lys 


Lys 


Lys 


185 








355 










360 










365 








186 




He 


Thr 


Gly 


Pro 


Ser 


Phe 


Met 


Gly 


Phe 


Leu 


Asn 


Ser 


Leu 


Val 


Ser 


Leu 


187 






370 










375 










380 










188 




Asn 


Ser 


Leu 


Tyr 


He 


Leu 


He 


Phe 


Val 


He 


Ala 


Leu 


Leu 


Phe 


Val 


Met 


189 




385 










390 










395 










400 


190 




Val 


Tyr 


Phe 


Ala 


Gly 


Gin 


Asp 


Asp 


Thr 


Pro 


Gin 


Pro 


Arg 


Arg 


He 


His 


191 












405 










410 










415 




192 




Glu 


Glu 






























193 


<210> 


SEQ 


ID NO 7 




























194 


<211> 


LENGTH : 


1774 
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195 <212> TYPE: DNA 

196 <213> ORGANISM: Momordica charantia 

197 <400> SEQUENCE: 7 

198 gcacgaggag ccggatgcgg cggccggtgc ttccgctcat cgtcacctcc ccgactttga 60 

199 tggttttgag ggaggtgccg aggacgagga ttttggggac ttctccgatt ttgaggactc 12 0 

200 ggatgctgat cgggatgagt acaaggcgcc ggaggtggac gagaaggatg tcgtcgtgtt 180 
2 01 gaaggagggt aacttcagcg atttcgtgga gaagaaccgg tttgttatgg tggagtttta 24 0 

202 cgctccctgg tgtggtcact gccaggcgct ggcgccggag tatgctgctg ccgccactga 300 

203 attgaaaggc gagaacgtgg ttttggcgaa ggttgatgcg acggaggaga atgaattgtc 360 

204 gcagaagtac gacgttcaag gatttccgac tgtttatttc tttgccgatg gagtccacaa 420 

205 gtcttaccct ggacagcgga ccaaggatgc tatagtaacc tggatcaaga agaagatcgg 480 

206 acctggtatt tacaacataa cttcggtgga agatgctgaa cgcatactga cttctgagac 540 

207 taaagttgtt cttggttacc tgaactcctt ggtgggccct gagagcaatg agcttgctgc 600 

208 tgcttcaaga ctggaagatg atgtcaactt ttaccaaacg gtggatcctg aagtggccaa 660 
2 09 gcttttccac attgaagctt cagcaaaacg ccctgccttg gtattgctta agaaggaggc 72 0 

210 tgaaaaactg aaccgctttg atggcgagtt ttctaagtct gcaattgctg aatttgtgtt 78 0 

211 tgccaataag cttccattag ttacaaagtt tacgagagaa agcgcaccat tgattttcga 840 

212 aagttcaatt aagaaacagt tgattctatt tgcgatttca aatgattcag agaaactaat 900 

213 ccccatattt gaagagtcgt cgaagtcttt taaaggaaag cttattttcg tttatgtgga 960 

214 aattgacaat gaagatgttg gaaagccggt atcagaatac tttggcatta gtggcaatgg 102 0 

215 tccagaggtt cttggataca ctggaaatga ggacagcaag aaatttgtgc ttgctaagga 108 0 

216 agttactttg gataatatta aggctttcgg agaaaatttc ttggaagaca agttaaaacc 114 0 

217 cttttataag tcagatccca ttcctgagac taatgatggt gacgtgaaag tagtggttgg 1200 

218 agacaacttc gacaatattg ttttagatga atcgaaggat gttctcctcg agatctatgc 1260 

219 tccttggtgt gggcattgcc aagcactgga accaacttat aacaagcttg ccaaacattt 132 0 

220 acgtggcatc gattcacttg tcattgctaa gatggatggc acaacaaatg aacatccccg 1380 

221 ggcgaagtcc gatggattcc caacaattct gtttttccca gctggaaaca agagctttga 1440 

222 ccctatcact gtcgataccg atcgtaccgt tgtggcactg tacaaattca tcaagaaaaa 150 0 

223 tgcatccatc cctttcaagc tacagaagcc agtttcgagt ccgaaagccg taagttctga 1560 

224 agccaaatct ggtgatgcca aagagagccc aaagagcagc accactgacg taaaggatga 1620 

225 attgtgaaga cttcttaaat agttttgtaa gttattatcc catcttttat gcactttttg 1680 

226 cagctgccag atttttagac catatggaga gactagaaat taaaagaaaa tgtttttttc 1740 

227 cctttttctt taggaaaaaa aaaaaaaaaa aaaa 1774 

228 <210> SEQ ID NO 8 

229 <211> LENGTH: 541 

230 <212> TYPE: PRT 

231 <213> ORGANISM: Momordica charantia 

232 <400> SEQUENCE : 8 

233 His Glu Glu Pro Asp Ala Ala Ala Gly Ala Ser Ala His Arg His Leu 

234 15 10 15 

235 Pro Asp Phe Asp Gly Phe Glu Gly Gly Ala Glu Asp Glu Asp Phe Gly 

236 20 25 30 

237 Asp Phe Ser Asp Phe Glu Asp Ser Asp Ala Asp Arg Asp Glu Tyr Lys 

238 35 40 45 

239 Ala Pro Glu Val Asp Glu Lys Asp Val Val Val Leu Lys Glu Gly Asn 

240 50 55 60 

241 Phe Ser Asp Phe Val Glu Lys Asn Arg Phe Val Met Val Glu Phe Tyr 

242 65 70 75 80 

243 Ala Pro Trp Cys Gly His Cys Gin Ala Leu Ala Pro Glu Tyr Ala Ala 

244 85 90 95 



1 

jr^v^. u VERIFICATION SUMMARY^ DATE: 10/22/1999 

PATENT APPLICATION US/09/417,251 TIME : 15:02:48 

Input Set: 1417251. RAW 



Line ? Error/Warning 



38 W "N" or "Xaa" used: Feature required 

39 W "N" or "Xaa" used: Feature required 
75 W "N" or "Xaa" used: Feature required 



Original Text 



gcgtaaggcc gctggtatca ttctacatga ttaagagt 
caaagggaac cctcgngggt ttaa 
tgtacgtgaa aaatggtcaa cantt 



i 



